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Abstract: Searching for the dynamical foundations of Havrda-Charvat/Daroczy/ 

Cressie-Read/Tsallis non-additive entropy, we come across a covariant quantity called, 
alternatively, a generalized Ricci curvature, an A^-Ricci curvature or a Bakry-Emery-Ricci 
curvature in the configuration/phase space of a system. We explore some of the implications 
of this tensor and its associated curvature and present a connection with the non-additive 
entropy under investigation. We present an isoperimetric interpretation of the non-extensive 
parameter and comment on further features of the system that can be probed through 
this tensor. 

Keywords: non-extensive entropy, Bakry-Emery-Ricci tensor, optimal transport, 

isoperimetric inequalities 


1. Introduction 

Havrda-Charvat [1] / Daroczy [2] / Cressie-Read [3,4] / Tsallis [5,6] entropy is single parameter 
family of functionals, which have attracted some interest in the Statistical Mechanics community over 
the last 25 years. To go straight to the point, assume that a discrete set of outcomes is labelled the set I. 
Assume that each outcome is labelled by / G / and the corresponding probability of its occurrence is 
indicated by Pi. Then the non-additive entropy Sq that we will be interested us in this work, is defined by 



( 1 ) 


Here ks stands for the Boltzmann constant which will be set /cs = 1 almost everywhere in the sequel. 
The naive extension to a continuous set of outcomes, is characterised by the probability density function 
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p : f2 —)■ R, and is assumed to be absolutely continuous everywhere on with respect to the Lebesgue 
measure (volume) dvolu is 



( 2 ) 


In both of the above expressions g G R, although a recent work has suggested [7] the possibility of 
g G C. We called the definition of Sq for continuous sets of outcomes (2) as “naive” since there has 
been some recent controversy about the validity of (2) [8-15] and some related skepticism on whether it 
is possible to extend Sq to continuous sets of outcomes. We consider this criticism to be valid and the 
general controversy not yet settled. Nevertheless, in the absence of a viable alternative or a consensus, we 
will use (2) in the sequel as the version of Sq for a continuous sets of outcomes, having the above possible 
caveat in mind. It is reassuring to notice that for g —1 one recovers the Boltzmann/Gibbs/Shannon 
(BGS) entropic functional 



(3) 


A question of fundamental importance for any entropic functional is to determine its dynamical 
foundations. More specifically, to determine the microscopic dynamical systems whose collective 
behaviour is encoded by the entropic functional under consideration. Moreover one can ask whether 
is it possible, even in principle for someone to predict which particular entropic functional, or less 
ambitiously, which are the common features of a class of entropic functionals that can effectively 
describe the collective behaviour given a microscopic dynamical system. In our considerations, despite 
the fact that such an assumption can be considered substantially, or even unnecessarily, restrictive, 
we will always have in mind Hamiltonian systems of many degrees of freedom. The dynamical 
behaviour of such systems is described by their evolution in their configuration or phase space. Such 
a space is a manifold M endowed with a Riemannian metric g. There may be additional structures 
present, such as the symplectic structure in phase space oj whose presence has unexpected and profound 
consequences [16]. However in this work, we will use exclusively the underlying Riemannian manifold 
(M, g) and ignore any further structures that may be present. 

In Section 2, we discuss the basics of Ricci curvature on Riemannian manifolds and the generalized 
/ N- / Bakry-Emery Ricci curvature and the related tensor constructed via optimal transport. In Section 3, 
we discuss the connection of the generalised Ricci curvature to Sq through a gradient flow and through 
the behaviour of the functions belonging to the displacement convexity classes VCm- In Section 4, 
we present an isoperimetric interpretation of the non-extensive parameter g. We also present an 
interpretation of g in terms of a projection arising from coupling the system to an external “thermostat”. 
In Section 5, we present a brief assessment of the current situation. 
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2. About the Ricci Curvature and Its Generalizations 

2.1. Geometry in Mechanics 

Consider an autonomous dynamical system of n interacting point particles, having mass rrii, i = 
1,... ,n. Let its configuration space M be parametrized by the local coordinates {x\x^}, i,j = 
1,... n and let its Lagrangian be 

1 ” 

^ = -'^mi{x"f-V{x\... ,x"^,x\... ,x"^) (4) 

i=l 

The dot stands for the derivative with respect to some parameter (“time”). To make the analysis more 
tractable, we simplify (4) by assuming generalized velocity-independent interactions, namely 

1/ = L(K}), z = l,...,n (5) 

This potential form is quite restrictive as it does not cover the case of point particles interacting 
with Electromagnetic or Yang-Mills fields. However, it turns out that such interactions can be 
straightforwardly incorporated by following a similar approach for the tangent bundle TM or more 
general, model-dependent, principal or associated bundles related to M. 

However, it turns out that such interactions can be straightforwardly incorporated by following a 
similar approach. 

If we wish to be slightly more general, by possibly incorporating the effect of constraints and reducing 
the system to truly independent variables (without constraints) even locally, we can use instead of (4) 

1 

where Mij stands for a positive definite quadratic form (mass matrix) which can be used as a 
Riemannian metric on M. In the above case (6), when all masses of particles are taken as equal to 
each other 

Mij = 5ij ( 7 ) 

we get back (4). Since the system is autonomous, with canonical momenta pi = bCjbxl, its Hamiltonian 

1 ” 

^ = -^Y.^\^V{x\...,x^) ( 8 ) 

i=\ 

corresponding to total energy E of the system, is an integral of motion. Maupertuis’ principle, recast in 
Hamiltonian terms, states that the motions of the system are the stationary paths in M of the adiabatic 
functional 

lF[c\ = Pi x^ dt (9) 

Jc{t) 

where c : [0,1] —)■ T*M, are rectifiable curves with fixed initial and final points having total energy 
EL = E. Maupertuis’ principle, as generalized by Hamilton, states that physical motions are extremals 
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of the length funetional (9) of sueh eurves c. In geometrie terms, sueh extrema are geodesies whose 
loeal eharaeterisation in terms of the eoordinate eomponents of the metrie 


g{di,dj) = gij, i,j,= l....,n 


( 10 ) 


are geodesies on the level sets of energy £' of M. The geodesies equation in sueh loeal eoordinates is 


j dx^ dx^ 
ds^ ds ds 


( 11 ) 


where the Christoffel symbols (eonneetion eoeffieients) are given, as usual, in terms of the metrie 
tensor eomponents gij by 

{didik + dkgji - digjk) ( 12 ) 

where di = djdx^ and the Einstein summation eonvention is assumed over repeated indiees in (9)-(12) 
and heneeforth, unless otherwise stated. 

Here, and in the sequel, s represents the are-length of a eurve c whieh is assumed to be are-length 
parametrised. An alternative to the Mij metrie, is the Jaeobi metrie 


g,, = 2{E-V{x\...,x^)}Mij 


(13) 


for whieh the geodesie equation (11) reduees to Newton’s Seeond Law 

dV 

dt"^ dxi 


(14) 


In elassieal Statistieal Meehanies we eonsider dynamieal systems having sets of “nearby” initial 
eonditions. Phrased differently, we are not eonsidering just one partieular geodesie Cq to deseribe 
the evolution of the system, but rather a set of them, all of whieh are initially elose to cq. The behavior 
of nearby geodesies is eneoded, in the linear approximation, via the Jaeobi fields with eoordinate 
eomponents J\i = 1,... ,n. In their dynamieal evolution, the infinitesimal separation between nearby 
geodesie is determined by the by the geodesie deviation (Jaeobi) equation. This is a linearization of a 
variation of the equation of geodesies and, expressed in the loeal eoordinates we are using, it is 


^ - n 

ds^ ds ds 


(15) 


where R^jki are the eomponents of the Riemann tensor in the eoordinate basis of TM. 


2.2. Rudiments of Riemannian Curvature 

The Riemann tensor R with eomponents in a eoordinate basis indieated by R^jki is a fundamental 
objeet in Riemannian geometry. It should be noted that despite a eentury and a half of intense exploration, 
many of its properties still remain unknown [17]. A way to motivate its introduetion is by searehing for 
a loeal quantity that allows us to determine the distanee between two points in a Riemannian manifold. 
Sueh a ealeulation is praetieally intraetable, as ean be seen by looking at the geodesie equation (11) in 
eonjunetion with (12). The Riemann tensor is a quantity that allows us to infinitesimally address sueh a 
question. This viewpoint, among several others, ean be seen in [17]. 
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The components of the Riemann tensor are explicitly given, in terms of the metric, by 

- d,T\, + (i6) 

The Jacobi field with coordinate components J\ i = 1,... ,n points from cq toward nearby 
geodesics, as noted above, but may also point along cq itself. The definition of the Riemann tensor 
is far more elegant and brings forth its linear and differential nature when expressed in terms of the 
Levi-Civita connection V (the unique symmetric connection preserving the metric) as a multi-linear 
map R : TM x TM x TM TM 

R{X,Y)Z = VxVy^ — Vy ~ V[x,y]-Z^ (17) 

where X,Y, Z G TM and [-, ■] indicates the Lie Bracket on TM. 

By taking advantage of (15), T can be chosen to be perpendicular to cq. We see from (16), that 
the behavior of nearby geodesics is controlled by the Riemann curvature tensor (17). Consideration of 
initial conditions near cq amounts to averaging in the n-1 perpendicular directions to Cq, as expressed by 
J\ i = 2,... ,n, by using a suitably chosen measure. The simplest case is to choose a uniform measure, 
meaning a measure which is a constant multiple of the Lebesgue measure (“Riemannian volume”) dvol. 
Then evolution of a set of initial conditions along cq is controlled by the average of the Riemann tensor 
in the 2-planes spanned by the tangent to cq and one of the perpendicular directions to cq, an average 
that results in the Ricci tensor along cq. Therefore, the Ricci tensor is given by the following, essentially 
unique, contraction of the Riemann tensor 


R., = R% (18) 

For geometric purposes, one uses the following far more transparent, but essentially equivalent as 
can be proved via polarisation, scalar quantities. Their formulation is slightly easier on orthonormal 
bases. So, consider an orthonormal basis of TM with respect to the metric tensor g indicated by 
{cj}, i = 1,... ,n. Let ei be the tangent to c. The sectional curvature in a 2-plane of TM at P G M 
spanned by Cp, e^, p ^ q, p, g = 1,..., n is defined by 

kp{ep,eg) = Rijki{epy{eqy{ep)^{egy (19) 

without a summation over p, q. In a slightly different notation, following the notation of (17), we can 
express the sectional curvature as 

kp{ep,eq) = g{R{ep,eq)eq,ep) (20) 


where no summation over p or q takes place, once more. 

As defined, the sectional curvature is formally a function on the Grassmann manifold G 2 ,n(M) of 
2-dimensional planes of TM. So, the sectional curvature is, at its core, an essentially 2-dimensional 
quantity. Its geometric meaning becomes evident through the following two theorems which are true in 
2-dimensions. Let P be a point of the Riemannian manifold M and let r be the radius of a geodesic 
circle around P. Let c(r) be its circumference and A(r) be the area of the geodesic disk whose boundary 
has length c(r). Then [18] Bertrand-Puiseux (1848) 


k p 


lim 3 ■ 

1—^0 


27rr — c(r) 
vrr^ 


( 21 ) 
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Essentially equivalently [18] (Diquet 1848) 


k p 


lim 12 ■ 

r —>-0 


vrr^ — A{r) 

7rr^ 


( 22 ) 


According to these theorems one sees that the sectional curvature determines how much the volume of 
a sphere or ball exceeds that of the corresponding sphere or ball in flat space. There are several other, 
equivalent, formulations of this geometric fact [19,20]. 


2.3. About the Ricci Curvature 

The Ricci curvature in the direction of ei, which as was stated above is assumed to be tangent to the 
geodesic c passing by P, is a symmetric bilinear form related to the Ricci tensor by 

Ric{ei,ei) = i?ij(ei)*(ei)-^ (23) 

In other words, it is given in terms of the sectional curvature at P G M, 

n 

Ric{ei,ei) = ^ kp{ei,ei) (24) 

i=2 

This makes evident the fact that the Ricci curvature is the outcome of averaging in all directions 
perpendicular to ei at P as was previously mentioned. This averaging is also explicit in the contraction 
in the indices of the Riemann tensor resulting in the Ricci tensor (18). Hence the definition of the Ricci 
tensor or Ricci curvature uses explicitly two distinct facets of Riemannian manifolds: their metric (and 
connection which is uniquely defined through the metric) and a measure (in this case the volume). These 
two concepts are uniquely inter-related in Riemannian manifolds exactly because such manifolds M are 
locally isometric, to first order, to the Euclidean space [17]. This can be most easily seen via the 
expansion of the metric in geodesic normal coordinates around P G M taken as the origin of the normal 
coordinate system 

9pq{{x"}) = Spq + ^Rprqs{P) + 0{\\x\f) (25) 

One can define the volume in Riemannian spaces via the following two requirements [17] 

• Eor surjective, distance-decreasing maps / : Mi —)■ M 2 , the volume obeys fo/Mi < no/M 2 . 

• The volume of the unit cube [0,1]” in R” is normalized so that vol ([0,1]”) = 1. 
and then prove the more familiar formula in local coordinates 

dvol = ^ dx, g = det gij (26) 

There are several ways for someone to understand the geometric meaning of Ricci curvature 
[17,19-21]. A very common one, which will be useful in the sequel, is through the Bishop-Gromov 
volume comparison theorem. The main idea behind comparison theorems [17,19-24] is that even though 
it is quite hard to analyze geometric features of manifolds by directly solving the partial differential 
equations equations resulting from infinitesimal considerations, such as the geodesic (11) or the Jacobi 
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equations (15), one may still be able to extract useful geometric and topological information about such 
manifolds by comparing them to “simpler” spaces. As such “model” spaces one usually considers the 
simply-connected manifolds of constant sectional curvature (“space forms”). In this spirit, one tries to 
establish inequalities, bounding the quantities of the manifold of interest by the corresponding ones of 
the model space. Such ideas can also be extended to more general metric spaces, quite frequently also 
endowed with measures [24]. This “comparison" idea is not foreign in Physics, where occasionally one 
encounters such inequalities, especially in more rigorous treatments of Statistical Mechanics [25,26]. 

There are different versions of the Bishop-Gromov volume comparison theorem expressing the same 
idea slightly differently. For our purposes we use the following: Assume that (M, g) is a complete 
Riemannian manifold with Ric > (n — l)fc, fc G R and P G M an arbitrarily chosen point. Let Bp{r) 
be the geodesic ball of radius r centered at P and Bk{r) be the ball of radius r in the space form of 
constant sectional curvature k and dimension n. Then the function 

vol Pfc(r) 

is a non-increasing function of r. This volume ratio approaches 1 as r 0 due to (25). An immediate 
corollary of this is (Bishop’s inequality) 


vol Bp{r) < vol Bkir) 


(28) 


From a physical (General Relativistic) viewpoint the Bishop-Gromov volume comparison theorem (27) 
is “obvious”. Let Seal indicate the scalar curvature, namely the unique contraction of the Ricci tensor 
Seal = Einstein’s equations 


2 _ SttG 

^ ^CQjL J- 


(29) 


n m 

determine the metric in terms of the stress energy tensor T^y. Here c stands for the speed of light 
and G is the universal gravitational constant. Roughly speaking, in areas where there is a lot of matter, 
T^y will result in strong gravity, so the bound in the Bishop-Gromov inequality will be high. Consider 
a homogeneous and isotropic space-time, as in the case of the Friedmann-Robertson-Walker (FRW) 
cosmology, as a comparison space. Any space-time that has more attractive matter than that, will make 
the geodesics converge faster, as seen also in (21), hence the volume of a ball centered along one such 
member of the geodesic congruence will contract faster than in the model (FRW) space. The above 
analogy relies in the fact that the results we use in the Riemannian signature carry over to the Lorentzian 
signature case, something that is known, even though non-trivial to prove [27]. 

A second implication of Ricci curvature, still in the context or Comparison Geometry, and of potential 
interest in Physics is via Lichnerowicz’s inequality [28]. This provides a lower bound on the lowest 
non-trivial eigenvalue Ai of the Laplacian A in terms of a lower bound on the Ricci curvature. As a 
reminder, the Laplacian A on functions / : M —R on a Riemannian manifold (M, g) is defined to be 
the trace of the Hessian 

A = tr (Hess) (30) 

or expressed in coordinates {x®, i = 1,..., n} in terms of the components of the metric gij 

A/ = 


(31) 
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where summation over repeated indiees is assumed. The signifieanee of Ai for Physios is substantial: 
in the partioular eontext of Lagrangian field theories, it expresses the lowest exeitation of the mass 
speotrum, or alternatively and depending on the interpretation, the mass of the lightest partiole in the 
partiole speotrum. Sinoe it is not praetioally feasible to explioitly oaleulate Ai even for the simplest 
manifolds, with soant few exoeptions, providing bounds to it is the best that someone ean hope for. 
Liehnerowioz’s theorem states that for (M, g) a compaot Riemannian manifold without boundary with 

Rij > kgij (32) 

for R 9 fc > 0, then Ai satisfies 

71 

Ai > - -k (33) 

n — 1 

It is worth observing that the right hand side is Ai of the sphere endowed with the round metrio, 
having oonstant ourvature k/{n — 1). A result by Obata [29] states that the equality is attained if and 
only if (M, g) is aetually isometrie to sueh a round sphere. This result ean be interpreted as stating that 
knowing the mass of the lowest exeitation, alongside the Ricci curvature lower bound (32), uniquely 
determines the whole spectrum of excitations in this space(-time). It actually determines much more 
than that: it completely determines the geometry and the topology of such a space(-time). In General 
Relativity, or diffeomorphism-invariant theories, the Ricci curvature bound (31) usually results from 
imposing a strong energy condition [27], namely a lower bound to 

(t,. - ^ 9,.) X'-y (34) 

where X,Y G TM with some additional causal behaviour assumed. Such a requirement (33) constrains 
the properties of what a “reasonable” mass-energy distribution in space(-time) should be allowed to 
have. In the above paragraph, we use the word “space(-time)” loosely, as we always assume that g has a 
positive-definite signature. 

2.4. Generalized Ricci Curvature 

There are several ways to go about generalising the above concepts. There is not any really unique 
extension of the Ricci curvature in more general metric-measure spaces, but several proposals, slightly 
different from each other exist. There are several different motivations for such generalizations. The most 
immediately pertinent one for our purposes, is the claim that Sq describes systems that are non-ergodic. 
As a result, in such cases one does not have available a uniqueness statement like the Krylov-Bogoliubov 
theorem [30], or a statement relating the asymptotic averages along a trajectory (or iterates of maps) and 
phase space (micro canonical) averages like Birkhoff’s ergodic theorem and its implications [30]. On 
the other hand, things may not be as uncontrollable as they may appear at first sight: indeed according 
to the Ergodic Decomposition theorem [30], every invariant Borel probability measure of a continuous 
map on a metrisable compact space can be decomposed into a sum of ergodic invariant probability 
measures each of which is supported in disjoint subsets of the whole space. In practical terms though, 
concretely determining such a decomposition into sets having ergodic measures may not be tractable. So, 
to proceed along these lines, we assume that a non-ergodic measure is absolutely continuous with respect 
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to the volume of the underlying eonfiguration / phase spaee M whose projeetion on the total energy E 
hyper-surfaees M^; gives rise to the miero-eanonieal measure. Henee it is of interest to determine a 
generalisation of the Rieei tensor that is defined with respeet to the measure 

dp = dvolM (35) 

rather than with respeet to the volume element, where / : M R. So, the question is to define a 
Rieei-like tensor that will infinitesimally eontrol the behaviour of sueh a dp as in (34). The easiest 
approaeh, used extensively in Physies, is to eonstruet, by hand, a simple two-index symmetrie tensor 
from the sealar / and from the Rieei tensor Rij. Symmetrie tensors eonstrueted from / and involving 
two derivatives are {dif){djf) as well as the Hessian 

Hess / = Vdf (36) 

whieh for X, F e TM amounts to 

Vdf{X,Y) = X{Y{f))-{VxY){f) (37) 

In loeal eoordinates x*, i = 1,... ,n the Hessian eanbe written as 

(Hess /),, = {d,d, - r%dk)f (38) 

where the summation eonvention upon repeated indiees is assumed. So the easiest ehoiee would be to 
define a generalised Rieei tensor {RN)ij by 

{RN)ij = Rij + (Hess f)ij (39) 

This definition is essentially due to Bakry and Emery [31] where iV is a letter whieh is just part of the 
notation, at this stage. The realisation that the term {dif){djf) ean also be ineluded with an arbitrary 
undetermined eoeffieient N G [1, oo) and the exploration of some of its eonsequenees was provided first 
by Qian [32]. Therefore, a more general definition of R^ is provided by 

(Riv)ii = Rij + (Hess f)ij - ^ df)ij (40) 

The exaet normalisation in front of the last term is a matter of eonvention. What is not a matter of 
eonvention though, is that in this expression there is the undetermined eoeffieient N whieh has to be 
inserted by hand sinee it is not determined by any of the differential or algebraie (sueh as the Bianehi or 
the Palatini) identities that the Riemann tensor and its eontraetions obey. 

The geometrie origin of N is straightforward to pinpoint: in the Riemannian ease, the metrie uniquely 
determines the volume element and the Hausdorff dimension of the manifold expresses this unique 
relation between the metrie and the volume, as in the diseussion preeeding (26). In the ease of a general 
metrie-measure spaee, the measure is not related in any unique way to the metrie. In the ease of Finsler 
spaees for instanee, whieh from a partieular viewpoint are eonsidered to be a “direet" generalisation of 
the Riemannian manifolds, no unique measure is preferred, so several have been proposed depending 
on one’s goals [24,33]. Therefore, the effeetive dimension of the measure dp (35) in a metrie-measure 
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space cannot be captured by the Hausdorff dimension n of M, since n is really expressed in terms of the 
volume of M. Hence the effective dimension of dp, is an additional piece of information that has to be 
provided, a priori. 

One can put together the definitions (39) and (40) by observing that limAr_).oo of (40) reduces to (39). 
Putting all these elements together, one can state that the central differential quantity for characterising 
the non-ergodic behaviour of systems conjecturally described macroscopically by Sg is the generalized- 

/ N- / Bakry-Emery- Ricci tensor defined by 

/ 

Rij + (Hess f)ij, if N = oo 


{RN)ij 


Rij + (Hess f)ij - 0 df)ij, if n < N < oo 

< 

Rij + (Hess f)ij — oo ■ {df 0 df)ij, if N = n 


(41) 


—oo, if iV < n 

\ 

where, by convention, oo ■ 0 = 0. The last two lines are put there for completeness and for having a 
unifying treatment of these extreme cases with the rest. The definition (40) is taken from the works of J. 
Lott and C. Villani, in particular [34,35]. We observe that the generalized Ricci tensor Rn is, in reality, 
a one-parameter family of tensors giving non-trivial results when N G [n, oo]. 

It may not come as a total surprise that such an extension of the Ricci tensor and the related curvature 
is not unique, see [36-38] for some alternatives, for instance. We will mention the synthetic definitions 
which reduce to (41) in the case of smooth measure spaces in the following Sections. The definition of 
(41) is a matter of choice, which however has many desirable properties [34,35]. Our interest to this 
choice having some more widespread, or even fundamental, significance comes from that it has been 
“re-discovered" and used independently in different contexts. An example of the former occasion is via 
the work of Bakry-Emery [31] and Qian [32]. The first two authors are interested in properties of the 
heat semigroup in the presence of external conservative forces which provide an additional drift term 
to the heat equation. Their analytical approach lead them to a Bochner formula whose further analysis 
lead to the definition of the curvature-dimension CD(k, n) condition. Details can be found in [31,39]. 
This condition expresses properties of manifolds of sectional curvature at least k and dimension at most 
n in a way which is strongly reminiscent of the Gromov pre-compactness theorem [24]. A second 
occasion, is the case of the extensive use of the N = oo expression in (41), in particular, in the Ricci flow 
leading to the proof of the Poincare, and consequently of Thurston’s geometrization, conjecture by G. 
Perelman [40-42] . A third occasion was the work of Chang, Gursky and Yang [43] who asked on whether 
one could define conformally invariant analogues of the Ricci and scalar curvatures on smooth metric 
measure spaces. The comparison and interpolation between [31] and [43] was taken up by Case [44,45], 
who has also contributed to the investigation of the quasi-Einstein metrics {RN)ij = ^dij resulting 
from the definition of (41) for N < oo. In the Physics literature, the generalised Ricci curvature 
has been recently discussed in explorations of scalar-tensor (e.g., dilaton, Brans-Dicke etc.) theories of 
gravity [46-48]. 

The geometric meaning of {RN)ij starts becoming clearer when one tries to check on whether, or 
under what conditions, standard results of Riemannian geometry can be extended to smooth metric 
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measure spaees [34,35,49,50]. A theorem in the spirit of the Bishop-Gromov volume eomparison goes 
as follows: Consider a eompaet, smooth metrie measure space (M, g, p) where following (35), 




dvol 


M 


lBp{r) 


(42) 


having (i?Ar)jj > kgij, k > 0 for some N G [1, cxd). Then for all 0 < r < i? and P G M one has 


EiBpjR)) ^ (R\ 

/i(Pp(r)) “ 


(43) 


The theorem actually holds for the class of measured length spaces, which are more general than 
Riemannian manifolds. Usually volume comparison theorems in this spirit also require some additional 
conditions (such as bounds, convexity properties etc.) on /. It is worth noticing that according to 
the Bishop-Gromov comparison theorem, the weighted volume of M does not expand faster than 
polynomially, with the exponent being exactly N. This justifies, to some extent, the statement made 
above that N can be seen a substitute for the Hausdorff, or as an effective dimension, of the measure p 
on M. 

Regarding Lichnerowicz’s inequality, the generalization goes as follows: Since the measure of 
integration changes from dvol to dp the Laplacian has to be modified if we want to keep it a self-adjoint 
operator with respect to dp. The new Laplacian is 


= A-g(V/,V-) 


(44) 


It is straightforward to check that this is indeed a self-adjoint operator with respect to the new measure 
dp, namely that for pi,<p 2 G L^(M, p), one has 


(piAfip2 dp 


g(V(^i, V</92) dp 


{Afipi)(p2 dp 


(45) 


JM JM JM 

It should be noted at this point that, physically, this is the Laplacian on functions of M but in an external 
field whose potential is /. Therefore, maintaining the self-adjoint property of the Laplacian amounts to 
modifying its sub-leading symbol by adding a drift term. Generalizing Lichnerowicz’s theorem one can 
prove that if {RN)ij > kgij, with /c > 0, then 


Ai > 


N 


N -I 


k 


(46) 


Comparing (46) with (33) we see, once more, the validity of the interpretation of A as a dimension of 
the measure p. It is quite interesting to notice that in the case of A = oo the resulting inequality is 
Xi > k which is independent of A. This has important implications in the sequel as the N = oo case 
will turn out to be related to Sbgs when a finite A is related to Sg. 

Having stated all the above, it appears, and correctly so, that the considerations are quite generic and 
could be satisfied by any functional form related to non-ergodicity. This is true, in part. It turns out 
that an infinity of entropic functionals could play a significant role, in this respect. The functional Sg 
has some special significance even among them, although it is not unique. How these elements work 
together, will be discussed in the next Section. 
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3. Ricci Curvature via Optimal Transport 

There are several threads of development leading to the relation between Sg and the generalised Rieei 
eurvature Rjv. For a eomprehensive treatment of these topies, one ean start by eonsulting [35]. 

3.1. Otto’s View: the Porous Medium Equation and the Geometry of Space of Probability Distributions 

An influential idea that eonnects Sg with the above developments is due to Otto [51]. He studied the 
set of solutions of the porous medium equation in 


dtp-Ap^ = 0 


(47) 


where p > 0 is a density function on which explicitly depends on time t G [0, oo). In this equation 
m > and m > for reasons that will be stated later. The question that arose was how to interpret 
the porous medium equation as a gradient flow equation. As a reminder, a gradient flow [40^2,52] 
of sufficient generality for our purposes, consists of a Riemannian manifold (M, g) and an “energy” 
functional E[p\ on (M, g) obeying the autonomous differential equation 


dtp = -VE[p] 


(48) 


A simple example of such a gradient flow is the heat/diffusion equation where E[p\ is the kinetic energy 
(Dirichlet functional) of p. The interesting property of a gradient flow is that E[p\ decreases along the 
actual trajectories p{t). The established practice until [51] for the porous medium equation (47), was 
the following: The manifold M of its solutions was taken to be 


M = <^ p : R"' —)■ R_|_ such that / p dvol = 1 

7r" 


(49) 


where dvol indicates the infinitesimal volume element on the space we are integrating over (in this case 
R"). Its tangent space at p G M, TpM, is 


TpM = < s : R” —)■ R, such that / s dvol = 0 


'R" 


(50) 


The tangent space TpM can also be seen as 

TpM = {0:R”^R}/~ (51) 

where ~ identified f differing by an additive constant and where f were solutions of the Poisson equation 

— A0 = s (52) 

The metric tensor g at p G M was taken to be 


gp( 01 > 2 ) = / (yffiV 4 ) 2 ) dvol 


(53) 


'R" 


The “energy” functional was taken to be 


E[p] = 


p’”+‘ dvol 


m + 1 


'R” 


(54) 
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Otto’s view was to keep (49)-(51) as they were, but ehange (52) to requiring 0 to satisfy 


V • (pV0) = s 


( 55 ) 


and modify the metrie g on M from (53) to 



JR" 


(56) 


One ean immediately see from (49), (53), (56) that (M, g) is an infinite dimensional Riemannian 
manifold, so appropriate eare should be taken of eonvergence issues in a more eareful, less formal, 
treatment. The “energy” functional was modified from (54), by Otto, to be 




(57) 


p log p dvol for m = 1 


We recognise right away that the “energy” functional in (57) has the exact same form as the entropy Sq if 
q is identified with m. We also see that plays the role of the “energy” functional in the gradient flow 
characterisation of the ordinary heat/diffusion equation which corresponds to m = 1 in (47). Moreover 
we see the prominent role that the modified measure p dvol, which is exactly (35) with p = exp(—/), 
plays in the viewpoint advocated in [51]. Otto went much further along these lines. He established a 
formal calculus on (M, g), dubbed “Otto calculus”, without paying too much attention to the subtleties 
arising from the fact that (M, g) is infinite-dimensional. He discovered, for instance, that the manifold 
(M, g) has formally non-negative sectional curvature. Addressing more carefully some of these topics 
was undertaken by Lott [53], among many others [34,54-56] and several aspects are still under intense 
investigation. 

The long-term asymptotic behaviour of the porous medium equation is expressed by the Barenblatt 
solution ps. To determine the scaling behaviour of a solution in the asymptotic regime close to pB, we 
re-express it as 



i = 1,... ,n 


(58) 


2 + n{m — 1) 

One can prove that p satisfies the gradient flow equation, where r = log t 


(59) 


drP = -VE[p\ 


(60) 


where the modified energy functional E is given by 


E[p] = EIp] + aV[i^ 


(61) 


and where V is represents the second moment functional of p 
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with |xp being the Euclidean norm of {x*, i = 1,... n} G R". Otto proved [51] that E is uniformly 
strictly convex on (M, g) since it satisfies 

Hess E[p] > al, V p G M (63) 

and moreover that 

^ < 0 (64) 

An interpretation of these results in the context of Sg is as follows: First of all, the convergence to the 
Barenblatt solution, which we have not explicitly stated, but whose scaling is the same as that of (58) 
can be traced back to the origins of Sg in the Physics literature [5,6]: fractals usually do have some form 
of self-similarity which is expressed by scaling properties. Second: the ad hoc constraints on the value 
of m stated above, can now be seen as requirements for the convexity of E and for the well-posedness 
and finiteness at pb of E and V. Third, we observe in (64) that the rate of convergence of p to pB is 
polynomial (power-law) in terms of t. This, conjecturally, is one of the important properties of systems 
described by Sg: their power law rate of convergence to their long-time asymptotic limits [6]. It is worth 
mentioning that this rate of convergence is believed to also have the form of a g-exponential 

eg{x) = {l + (l-g)x}^, 1 7 ^ g G R (65) 

However, the value of the non-extensive parameter controlling the rate of approach to equilibrium in 
the g exponent distribution is not necessarily the same as that in the equilibrium distribution itself [6]. 
There is no a priori reason why these two distributions that share the same functional form should also 
have the same non-extensive parameter. In the absence of any physical arguments to the contrary, we 
would expect that different aspects of the systems under consideration are described by different values 
of the non-extensive parameter, if we can even assume that their macroscopic behaviour is described by 
Sg in the first place. Moreover, if the renormalisation group lessons are taken at heart, it is also entirely 
possible that g, for a given system, may be different for the approach to different quasi-equilibrium states. 

Leaving these possibilities on the side, we can also see in (64) an aspect related to the existence of 
the observed, in numerical simulations, quasi-equilibrium states [57-59]. If the approach to equilibrium 
is polynomial, then it is too slow, with respect to an exponential time scale that we usually employ in 
analyzing the approach of systems to equilibrium. This way of thinking is similar to what we advocated 
in [60,61], for justifying why the largest Lyapunov exponent of systems described by Sg is zero. In 
closing, we would like to refer to [62] for a comprehensive treatment of the porous medium equation, 
including the Barenblatt solution stated above, and [63] for properties of some of its possible fractional 
generalisations. 

3.2. Optimal Transportation and Wasserstein Spaces 

In a seemingly different line of development from the above, the definition of the generalized Ricci 
curvature and its relation to Sg resulted from attempts to address aspects of the Monge optimal transport 
problem. This problem ascribed to Monge [64] tries to determine the least costly way of transporting 
a quantity of soil/earth (“deblais”) from a pile to an excavation/embankment (“remblais”). Naturally, 
one has to specify before-hand what is the cost function c which is used to determine such a total 
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cost. Although the problem was initially posed in R"^, it ean elearly be formulated for general eomplete 
separable metrie (Polish) spaees endowed with Borel probably measures. It may be worth mentioning 
right away, that the general problem remains unsolved even for simple eost funetions in R"^, although 
substantial progress has been made in speeial cases, espeeially during the last two deeades. 

The most obvious eost function c with a geometric significance, is the Euelidean distanee d{x,y) 
between of the points x in the pile and points y in the exeavation. This choice of eost funetion c{x, y) 
turned out to be diffieult to deal with, the root eause being its laek of uniform eonvexity properties. To 
get a simpler problem without deviating too much from the original, one uses instead as cost function 


c{x,y) 


1 

2 


d^{x,y) 


( 66 ) 


which turns out to be more manageable as it is uniformly eonvex in R"^ x R”. 

Monge’s transport problem can be mathematieally formulated as follows [34,35,65-67] (these are 
general referenees for this whole subsection): consider two Borel measures on R"^, or in more 

general Polish measure spaees mentioned above, sueh that 


/i+(R”) = /i'(R”) 


(67) 


both of which are finite. In addition, consider the elass of Borel maps / : R"^ —)■ R” pushing forward 
/r+ to pT namely 

(/,fi+)(/l) = ,^+(/-M), V/lcR" (68) 

This ean be translated as requiring 

[ w{f{x))dp+{x) = j w{y)dp-{y) (69) 

J J ■^‘2 

for all eontinuous functions w : R” —R with Ai, A 2 C R” indicating the support of /i+, p~ 
respeetively. Let the set of sueh funetions be indieated by IF. Given a eost function c : R” x R” —)■ [0, cx)) 
consider the total eost funetional 

^m[/] = [ c{x,f{x))dp^{x) (70) 

Jr" 

The goal is to determine the existenee and find the properties of an optimal transport fo & (F sueh that 

dMifo] = min luif] (71) 

f 

This problem has proved to be quite difficult to address. There are at least three reasons for this 

• The problem is highly non-linear. To see this more concretely, let’s assume that both 
are absolutely eontinuous with respeet to the volume element of R" with corresponding 
Radon-Nikodym densities p~■ Then it turns out that the push forward eondition (68) translates 
into the Monge-Ampere, non-linear, equation 

= p-(f(x)) det(V/) (72) 


where det stands for the Jaeobian determinant of the differential map V/. 
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• Such a solution may not exist: consider for instance to be the Dirac delta function but not p . 

• The transport condition (69) is not weakly sequentially closed and this creates the additional 
complication that the minimum (71) may not be realized, as subsequences of approximating maps 
may not converge in any reasonably weak topology. 

Progress toward solving Monge’s problem was slow until Kantorovich [68,69] substantially 
reformulated it in more amenable terms. The difficulties associated with the Monge problem are partly 
attributable to the fact that the optimal transport map the fo that we seek cannot “split” the measure 
Heuristically, f E cannot move a part of in a location of and another part in some other 
location of R”, as such a map would not be well-defined. Such a constraint (68) is too strong to handle. 
Instead, Kantorovich made the following two modifications. First, he transformed the problem into a 
linear one, by recasting it as follows: Consider the space R(R” x R"^) of Borel probability measures p 
whose push-forwards pi,p 2 on the first and second R” factors respectively are and 

V = {/r : Probability measures on R” X R" such that (pi)(t/i = (p 2 )tt= E~} (^3) 

The elements of V are called transference plans. Consider then, for p E V the modified cost functional 

IxifA = / c{x,y) dp,{x,y) (74) 

This functional is linear in p so if some assumptions on the cost function can be made, compactness can 
ascertain the existence of at least one minimizing measure, called optimal transference plan, po & V 

Ik[Po] = min Ik[p] (75) 

li&V 

It should be stressed at this point that there is no a priori guarantee that an optimal transference plan po 
of the Kantorovich functional Ik (75) corresponds to a mapping fo sought after in Monge’s functional 
Im (71). 

The second crucial observation allowing to make progress in solving the Kantorovich problem is the 
Kantorovich - Koopmans [68-70] duality: in the spirit of duality in convex analysis and geometry they 
reformulated the variational problem (74) so that instead of a minimisation, it became a maximisation 
problem. To formulate this dual variational problem, define the set 

M. = X R'"' R- continuous, 0(a:) +'f{y) < c{x,y)} (76) 

Then, introduce the dual of the Kantorovich functional Ik* as 

7ir*[0,V'] = [ (p{x)dp'^{x) + j fj{y)dp~{y) (77) 

7r’^ 7r" 

The Kantorovich-Koopmans duality is the statement that the minimisation problem (75) is equivalent to 
the maximisation 

lK*[<Po,fJo] = max lK*[(l),fj] (78) 

This problem is amenable to the methods of linear programming and as such it is easier to solve than the 
original (70). What is still not clear though is whether a solution of (78) can be derived from a solution 


Entropy 2015 ,17 


17 


of (71). The converse is clearly true: suppose that fo is a minimizer of (71). Then Id x /„ : R" — 
R” X R"^ and p = (Id x fo)^p'^ G V minimizes Ik in (74) so it is a solution to (75). 

Let’s be more general in this paragraph and assume that M is a Polish space (complete and separable 
metric space) endowed with a distance function d and let p G R(M x M) be a transference plan with 
marginals p'^ and p~ as above. Moreover, let’s assume that M is compact, for simplicity. Let’s choose 
as cost function 

c{x,y) = dF{x,y), p>l (79) 

The Kantorovich functional (74) with such a cost becomes 

^k[p] = [ d^{x,y)dp{x,y) (80) 

7mxm 

Then (80) can be considered as a measure of the discrepancy between the marginals p^, p~. Naturally, 
there are many ways to define the discrepancy between measures [71], depending on one’s goals. For our 
purposes, the optimal transference plan (75) with the cost function (79), in other words the minimizer 
of (80), enters naturally in the picture. It is interesting to notice that such an optimal transference plan 
always exists under the above assumptions [35] so the variational problem (75) does have an actual 
solution. If, in addition one sets 

{Wp{p+,p-)Y = Ik[Po] (81) 

or, in other words, if one defines for any two Borel measures pi, p2 G P(M x M) 

Wp{pi,p 2 ) = (min / d^{x,y) dp{pi,p 2 )] (82) 

Jmxm / 

where the minimum is taken over all possible transference plans p G V with marginals pi, p 2 , then it 
can be immediately checked that Wp{pi, P2) satisfies all the properties of a distance function. As such, 
it is called the p- Monge-Kantorovich-Rubinstein-Vasherstein distance, or in short and even though it is 
a partial misnomer, the p-Wasserstein distance (metric). All these metrics, except W^o, give the same 
topology on a compact M which is the weak-* topology. For such a distance function to be non-trivial 
the integral in (82) must converge. This becomes quite important especially when M is non-compact. 
Hence, the definition of the p-Wasserstein distance restricts the elements of V to only the ones with 
finite p-moments. The number of converging moments required for each case of p is the only essential 
difference between the different Wp metrics on V. The subspace 'Pp(M x M) of P(M x M) of measures 
with converging p-moments, endowed with the Wasserstein distance then becomes a metric space with 
several desirable properties [35] and is called the p-Wasserstein space of M. We forego expanding on 
this topic of current research as further information is not needed in the subsequent discussion.. 

3.3. The Brenier Map and Its Extensions: the Role of Convexity 

We go back to R"^, initially at least, and to assuming that c{x, y) = \d^{x, y) where d denotes the 
Euclidean distance in R"^. We would like to determine a solution to Monge’s problem in this case, 
namely to minimize 

^ m [/] = I f dp^{x) 


(83) 
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Skipping a detailed explanation of the reasons why [72,73], we just state the result [74] (Brenier map): 
Let and be probability measures on R" whieh are absolutely eontinuous with respeet to the 
Lebesgue measure (volume) vol of R”. Then there exists a eonvex funetion p : R” —R whose 
gradient f = Vp : R” —)■ R"^ pushes forward /i+ to . This map is unique, almost everywhere, 
and it therefore provides a unique solution to Monge’s problem (70). Immediate generalizations are due 
to [75]. In the ease of the Brenier map, the Monge-Ampere equation (72) for the eorresponding densities 
P+, p~ takes the more familiar form 

p+(x) = p~{Vp{x)) det(V^(p(a;)) (84) 

as long as Wp{x) G R"^. The eonvexity of p implies that Jaeobian obeys det(V^<p) > 0 and it also 
guarantees that / is almost everywhere differentiable. To prove this, one ean use the dual Kantorovieh 
formulation (77). The eonstraint in (76) beeomes 

(t){x) + i){y) < ]xi^{x,y) (85) 

Now, changing variables to 

IxP IwP 

v{y) = —-ij{y) (86) 

(85) amounts to 

u{x) + v{y) > X ■ y (87) 

so, (78) reduees to minimizing the funetional 

Ik*[u,v] = / u{x)dp^{x) + / v{y)dp~{y) (88) 

under the constraint (87). By standard results of convex conjugacy theory [76], we know that there is 
such a pair UoiVo minimizing (88) and moreover these two functions are Legendre-Fenchel transforms 
of each other, namely they obey 

Uo{x) = Tnax{x ■ y - Voiy)}, Vo{y) = Tsiax{x ■ y - Uo{x)} (89) 

We will skip the non-trivial topic of the regularity of solutions to these variational problems altogether, 
referring to [35,67] for some statements and an extensive list of references on such issues. 

The question that is raised now is whether the above results can be extended from R"^ to Riemannian 
manifolds (M, g). This was addressed in part by [77-79] and by [80] for metric measure spaces. One 
modification that needs to be made is to replace the Legendre-Fenchel transform (86) by the generalized 
convexity transform 

h^{y) = ini {c{y, x) — h{x)}, h"^{x) = inf {c{x,y) — h^{y)} (90) 

xgM j/SM 

for : M —R U cx) with the cost function c{x,y) having the usual form c{x,y) = \d?{x,y) as 
for R"^. Then Brenier’s theorem [78] is extended as follows. For a closed, connected Riemmanian 
manifold (M, g) let p+ be absolutely continuous with respect to wo/m and p~ arbitrary. Then there 
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is a map 99 : M —R such that = p {ip is a c-concave function) so that the map exp(—V 99 ) 
pushes forward to /r”. Sueh a map is a unique minimizer within the set P(M x M) for Monge’s 
transportation funetional (83). In the language of the 2-Wasserstein spaee 11^2 (M), the Monge transport 
between takes plaee along the unique Wasserstein geodesie joining them. The last statement 

is substantially non-trivial, beeause if M were a general length spaee, there eould be an uneountably 
infinite number of geodesies joining the two measures. 

A seeond modifieation addresses the issue of eonvexity on Riemannian manifolds. There is no 
unique extension of eonvexity from R” to a Riemannian manifold M. Even then, one ean either 
work direetly geometrieally with the spaee at hand, utilising properties of the distanee or the volume 
funetions, or express eonvexity indireetly through funetional inequalities sueh as the Brunn-Minkowski, 
the Braseamp-Lieb, the Prekopa-Leindler etc. In [79,80] the seeond approaeh was taken. Results in 
Comparison Geometry were derived and extensively used. We will state just one sueh result to pave 
the way to the synthetie definition of the generalised Rieei eurvature below. To generalise the linear 
interpolation tx + (1 — t)y between two points x, y G R” to the ease of M with distanee funetion d, 
they [79] start by eonsidering the set 


Zt{x,y) = {z G M, such that d{x,z) = td{x,y) and d{y,z) = {l—t)d{x,y)} (91) 


where t G [0,1] and for lying between x, y. This definition ean be extended from a point y to the 
“target” set F c M as 

Zi(x,F) = \JZ^{x,y) (92) 


y&Y 


If F G M, eonsider the open ball eentered at P of radius r by Bp{r), and introduee a volume distortion 
by the volume ratio 

vol Zt{x,Bp{r)) 


Wt{x,P) = lim 


(93) 


r-5-o vol Bpitr) 

for t 7 ^ 0. Obviously Wi{x,P) = 1, and Wt{x,P) > 1 if the eurvature is non-negative, whereas the 
opposite is true if the eurvature is non-positive. In R"^ elearly Wt{x, P) = 1. An interpretation of (93) 
from a relativistie viewpoint, pretending for a moment that we are working in a spaee of Lorentzian 
signature, sinee light does travel on null geodesies wq{x, P) represents the magnifieation of the area of 
a small light souree loeated near F as is seen by an observer at point x. We eompare this distortion 
funetion (93) with that of the standard n-dimensional spaee forms having eonstant seetional eurvatures 
k > D and A; < 0 respeetively. The Rieei eurvature in all these ease is Ric = {n — l)k. Then for k G R" 
we set 

T- ^>0 

snk{t) = fc = 0 (94) 



Then, if (M, g) is sueh that Ric > {n — l)k along any geodesie of length d{x,y) one has from the 
Bishop-Gromov eomparison theorem (27) 


wt{x,y) > 


snk(td{x,y)) 

snk{d{x,y)) 


n—1 


(95) 
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where equality holds when M has eonstant seetional eurvature k. This result be used in the sequel in the 
synthetie definition of the generalised Rieci curvature. 

As in Otto’s work for R"^, it was noticed that providing lower bounds on the Hessian (63) which is 
related to the energy functional (57), could be useful in determining rates of convergence of gradient 
flows on M to their asymptotic configurations. For the “energy” functional (57) with m = 1 [77] 
calculated its formal Hessian of P(M) and found that it is bounded from below by fcgp (56) as long 
as the generalized Ricci tensor {RN)ij for N = oo of M is bounded from below by kgij. Hence the 
k-convexity, as in (63), of an appropriate energy functional in 'P(M) is intimately related to the a lower 
bound of the generalized Ricci curvature of M. The converse statement, still using the m = 1 case 
of the “energy” functional in (57), was established by [81]. The obvious question is whether these 
considerations can be extended to the cases of the “energy” functional in (57) for m 7 ^ 1 and how would 
this influence the definitions of the generalized Ricci curvature. An answer was provided by [34,82-86]. 


3.4. Displacement Convexity and Synthetic Definition of the Generalized Ricci Curvature 


In this subsection we follow [34,86] and can be even more general than before by considering, not 
necessarily smooth, metric measure spaces (X, d, u). Here u is Borel will be considered as a “reference 
measure”. For Riemannian manifolds M the role of u is played by the volume, but a generic metric 
measure space lacks such a natural choice. It turns out that without substantial loss of generality, the 
constructions remain largely the same if we only consider measures p, absolutely continuous with respect 
to u with Radon-Nikodym density p, namely p = pu. The general case is discussed in [34,86]. The 
first step is to consider a function U : [0, 00 ) —)■ R which is continuous and convex. Second, given 
p,u e 'P(X), define the functional Uv{p) by 


Uu{p) = [ U{p{x))du{x) (96) 

Jx 

The role of these definitions is to allow us to construct “energy functionals” for a gradient flow akin to 
(57). Actually, from a physical viewpoint these are the entropy functionals Sq and Sbgs in the first and 
second lines of (57) respectively. Hence the original assumptions on the function U. To recover (57), 
consider the single-parameter family of functions Un ■ [0, 00 ) —R with N E [1, cxd], which is given by 


Then 


Upon setting 


UN{r) 


Nr[l-^), if X e ( 1 , 00 ) 
rlogr, a N = 00 


U, 


Nm 


N — N J^p^ N du, if N E (1, cxd) 
p log p do, if N = 00 


N 


1 

1-q 


(97) 


(98) 

(99) 


in (98), we recover (57), or equivalently (2), (3). This also fits nicely with a standing assumption of our 
prior work such as [87-90] in which we have examined properties of Sg for values of the non-extensive 
parameter q E [0,1] which are equivalent to N E [1, 00 ]. 
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At this point, one can ask what additional properties should the functions U obey so as to qualify as 
being the foundation of the functionals (96) which should be “reasonable”. Since functionals such as (96) 
can be interpreted as entropies, one can search for desirable properties that thermodynamic potentials 
possess [6,25,26,91] and demand these to hold in (96) or equivalently for the functions U. This motivated 
the introduction of the displacement convexity classes VCn and VCoo by [92]. A continuous, convex 
function belongs to the displacement convexity class VC^, i.e., U G VCn, for N G [1, oo), if 

«(() = t«u (T) (100) 

is convex on (0, cx)). Moreover, a continuous, convex function belongs to VCoo, U G VCoo, if 

u(t) = e*(7(e-‘) (101) 


is convex on (— oo, +oo). We can see that, if < N', then VC^' C VC^. More importantly, for our 
purposes, consider the following two functions that can be derived from U if it is sufficiently smooth, 
which have the form of Legendre transforms 

pir) = r — Uir) (102) 

dr 

which is motivated by the definition of the pressure, if U were to be the internal energy, at the level of 
functions rather than functionals. Moreover consider the “iterated pressure” 


P2{r) 


dp{r) 

dr 


p{r) 


(103) 


Then, with the above assumptions on U and also assuming that it is smooth, we have that U G VCn if 
and only if the function 


p(r) 

T N 

is non-decreasing on (0, cx)). Equivalently U G VC^ if and only if 


(104) 


P2(r) > 


p{r) 

~w 


(105) 


Among them, (104) shows the role of the function (97) giving rise to (98) and equivalently (57), (2) in 
VCn- This function (97) is by no means unique. However it has the best possible asymptotic behaviour 
that preserves the convexity property (100) defining VCn- Something similar can be said about the class 
VCoo'- if (7 G VCoo, then either U is linear or there exist a, 6 > 0 so that U{r) > ar logr — br. The 
latter functional form results to Sbgs for a probability density p. From the present viewpoint therefore, 
the function (97) giving rise to (98) is not unique at all, but has a very desirable functional form for 
membership in VC^ and VCoo that moreover gives rise to the entropic functionals (2), (3). 

As was also hinted in the last paragraph of Section 3.4, the definition of lower Ricci curvature bounds 
for a metric measure space is expressed via bounds on the convexity properties of functionals arising 
from functions belonging to VC^ or VCoo, such as (97). The “amount of convexity”, an example of 
which is shown in (63), is encoded in the definition of fc-convexity. Convexity of a smooth function / on 
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the unit interval t G [0,1], whieh will be assumed to parametrize a geodesie in (X, d, o), with x, ?/ G X 
ean be determined either by using the loeal eriterion ^ > 0 or by Jensen’s inequality 

f{tx + {l-ty)) < tf{x) + {l-t)f{y) (106) 


By the same token, for fc-eonvex funetions, one ean either use the eriterion ^ > k or the inequality 

f{tx + {l-t)y) < tf{x) + {l-t)f{y)-^kt{l-t)(f{x,y) (107) 

Relying on these, and for fc G R, a funetional with baekground measure u is ealled 


• fc-displaeement eonvex if for any two po, pi G 'P(X) and for all Wasserstein geodesies {pt}, t G 
[0,1], we have 


Ut,{pt) < tUi,{pi) + {1-t)U„{po)-^kt{l-t)W2{po,Pi) 

for all t G [0,1] 


(108) 


• weakly /c-displaeement eonvex, if for all po, pi G 'P(X) there is at least one Wasserstein geodesie 
along whieh (108) holds. 


R U {— cxd} by 


follow very elosely [34,86]. 

For any A: G R, define A : 

: T)Coo 

1 

f fclim^^o^, 

if A; > 0 


A{U) = 


if A: = 0 

(109) 

1 

[ A:lim,.^oo^, 

if A: < 0 



We say that for sueh a fc G R, the metrie measure spaee (X, d, u) has N = oo generalized Rieei eurvature 
bounded from below by k if the Wasserstein spaee R 2 (X) is weakly A((7)-displaeement eonvex, namely 


Uy{pt) < tUy{pi) + {I -t)Uu{po) -t)Wl{p0,Pl) (110) 

for any two measures Pq, pi G 'P(X), U G VCao and all t G [0,1]. For the finite N ease we need a few 
more definitions: given again A: G R and for = 1, define 


for N G (1, oo] and 


In addition, define 


/3t{xo,xi) 


/3t{xo,xi) 


oo, if A: > 0 
1 , if A: < 0 


a 


[JK 

\l N- 


d{xo,xi) 


exp (|A;(1 - t‘^)d‘^{xo,Xi)) , 

OO, 

■ A. 

sin(rQ:j \ 
t sin a I ’ 

1 , 

■ ^ \ 
sinh(cQ:) \ 

t sinh OL I ’ 


if = oo 

if N < oo, k > 0, a > TT 
if A^ < oo, A: > 0, a G [0, tt] 
if A^ < oo. A: = 0 
if N < oo, k < 0 


( 111 ) 


( 112 ) 


(113) 
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It should be noted that the last three entries of (113) are the same as the right-hand-side of (95) from 
which they originate. Now consider a transference plan r; G P(X x X) and decompose it in terms of its 
marginals po and pi 


dr]{xo,Xi) = dr]{xi\xo) dpo{xo) = dr]{xo\xi) dpi{xi) (114) 

where dri{xo\xi),dri{xi\xo) indicate the corresponding “conditional” measures. Then, the metric 
measure space (X, d, u) has iV-Ricci curvature bounded below by k, if there is some optimal transference 
plan T] from po to pi with Wasserstein geodesic pt, so that for all U G VCn and for all t G [0,1] 

U,{pt) < (1 - t) /x^x 

+ ^ (115) 

Even though this is a far from trivial statement, one can see it as a weight-averaged version of (107) 
with weights provided by measure ratios akin to the right hand side of (95) embedded into the entropy 
functionals whose convexity properties in W 2 (M) are used to reflect the Ricci curvature properties of M, 
all in a comparison sense. We observe that the definition (115) is synthetic: nowhere have we required 
the metric measure space to be smooth. Its apparent drawback for applications to Physics is that this 
definition of Ricci curvature is only defined in a comparison sense. However, it is directly related to the 
convexity properties of Sg in the Wasserstein space 142 (X). On the other hand, the definition (41) of the 
generalised Ricci tensor/curvature in smooth metric measure spaces is local, so it is easy to compute, in 
principle, but it appears to have nothing to do with Sg. However (41) and (115) give the same result, in 
a comparison sense, for a measured length space, hence for a Riemannian manifold: The Riemannian 
manifold (M, g) has its generalised Ricci curvature bounded below by k (115) if and only if [ 86 ] its 
A^-Ricci tensor (41) is also bounded below by k, namely {RN)ij > kgij. Before closing this Section, 
it may be worth noticing that for non-branching spaces such as Riemannian manifolds, there is no real 
distinction between an element of the displacement convexity class VCn and (97), so in this sense the 
entropic functional Sg itself is unique in the determination of the generalised Ricci curvature (115). 

4. Isoperimetric Interpretation of the Non-extensive Parameter and Related Matters 

We can look closer at (99) which provides a relation between the free parameter N in the definition 
of the generalized Ricci tensor (41) and the non-extensive parameter q in the functional (2) under the 
additional tacit restriction to g G [0,1]. Inverting (99) we get 

N-1 

This points out to an interpretation of q through N: N can be interpreted as an effective isoperimetric 
dimension of the measure p on the smooth metric measure space (M, g, p). It should be noted that the 
isoperimetric inequalities [24,93] (“why is a soap bubble spherical”), may arguably be the most important 
and influential among all the geometric inequalities [94]. For this reason, such an interpretation of q 
allows us to use the very extensive set of techniques and results that are known on this topic, mostly in a 
comparison sense, in reaching conclusions that may be of physical interest. It may be worth mentioning 
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at this point that the isoperimetric dimension is an asymptotie property of a spaee and ean be larger than 
its Hausdorff dimension [93], even if such a space is a Riemannian manifold: consider for instance the 
hyperbolic n-space whose Hausdorff dimension is n, but whose isoperimetric dimension is infinite, as 
volumes and areas of the boundaries of subspaces of hyperbolic spaces are related by linear isoperimetric 
inequalities. 

However, it becomes immediately obvious, that such an interpretation has a problem: indeed, even 
if we confine our attention to g G [0,1], it seems through, mostly, data fittings for finding q that the 
corresponding N is quite small, and certainly finite, for most systems. After all g is a thermodynamic 
parameter, so it is to be expected that its value in actual systems is finite, or even “relatively” small. 
On the other hand, for Hamiltonian systems of many degrees of freedom, their configuration and 
phase spaces have dimensions of order of magnitude n ~ 10^^ which represents a gross mismatch 
with the values of N computed through (116) and from data fittings of g. We have not been able to 
resolve this interpretational conundrum in an acceptably convincing way. The best that we can currently 
state, is to vaguely suggest that N should actually represent an effective isoperimetric dimension of the 
configuration or phase space per degree of freedom, something more akin to ^ in our notation. Given 
this, it might be of some interest to attempt to explain the appearance of the escort distributions [6,95] 
used in the calculations of the thermodynamic parameters [6], under this light. 

The Bishop-Gromov generalised comparison theorem (43) can also provide quantitative support for 
the relation between (41) and (2). It may be worth noticing, that according to [96,97], Sg describes 
the cases of systems whose configuration or phase space volume increases in a power-law, rather than 
an exponential, manner. Obviously checking whether this is true explicitly, in specific examples (of 
Hamiltonian systems of many degrees of freedom) seems to be quite hard, if feasible at all. However 
the generalized Ricci curvature (41) provides a feasible, at least in principle, way of doing so by using 
the Bishop-Gromov comparison result (43): if Rn > 0 then, according to (43), the corresponding 
measures on the configuration / phase space increase slower than a power-law fashion with exponent 
equal to N. This also provides another dimensional interpretation of N as the maximal Hausdorff 
dimension exponent bounding the expansion of measures in the configuration or phase space of the 
microscopic system. 

There is a slightly different way of understanding the effect of lower bounds of (41); it is in a way, a 
geometric analogue to the construction of the canonical ensemble in equilibrium Statistical Mechanics. 
In this treatment, proposed in [49], one initially considers the system of interest, let’s call its configuration 
/ phase space by B coupled to a “thermostat” F which can be large but not infinite (F is compact). Let 
the combined system’s configuration/phase space be indicated by M. Naturally, the evolution in M 
is Hamiltonian along geodesics chosen with respect to the metric of M. Project this evolution on M 
“down" to the system of interest B assuming that the tangents to the geodesic curves on M and B have 
equal lengths with respect to the respective metrics. This requirement essentially amounts to assuming 
that the average kinetic energy per degree of freedom of M, which might be called “temperature" but it 
is far from obvious that it would have a physical meaning for generic systems, is the same as that of B. 
Then this projection p : M —B is actually a Riemannian submersion [98]. Moreover, assume that the 
pushforward pjj of dvolM is just a multiple of dvol-e and let N = dim F. Let the Ricci curvatures of M, B 
be indicated by superscripts with respect to their corresponding metrics. Then for any /c G R, [49] proves 
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that if Ric^ > k, then Ric^ > k. So, a way to understand the meaning of the generalized Rieei tensor is 
to see it as the Ricci tensor due to the submersion of a higher dimensional space preserving the measure 
of the base up to a multiplicative constant. This statement may also make the generalised Ricci tensor 
quite useful for applications in theories involving higher (greater than 4) dimensional space-times. The 
result and the general ideas are also close to the treatment of [99,100] who expressed the non-extensive 
parameter q in terms of the scaling properties of the Hamiltonians of the “thermostat” F and of the 
system under study B. 

There is the also the obvious question of how would one go about dealing with cases of systems having 
g > 1. Whether Hamiltonian systems of many degrees of freedom can be described by Sq, g G R 
was taken for granted for a considerable amount of time. More recently though, some dissenting 
opinions have appeared (see for instance [13,15]). We will not go into this very important matter 
which deserves a thorough investigation, in the present work. Let’s assume for argument’s sake that 
the “conventional wisdom” is true and Sq can describe such Hamiltonian systems. Then a possible way 
around the restriction g G [0,1] that we use in this work may come from “dualities” of the non-extensive 
parameter. It has been observed for a while [6], that systems described by Sq seem to have particularly 
nice properties under the following transformations/“dualities" of the non-extensive parameter 



Q 


These are a set of generators of conformal/Mobius transformations on the complex plane, assuming that 
g G C. Their existence may be allow us to extend the validity of elements of the formalism presented 
here outside the restricted range g G [0,1] that we have assumed. Whether or to what extent such a 
goal can actually be achieved, even formally, its implications and the origin of the dualities (117) is the 
subject of one of our current investigations [101]. 

5. Assessment and Omissions 

In this work, we have attempted to bring to the attention of our audience the existence, construction 
and geometric significance of the generalised (Bakry-Emery-) Ricci tensor (41) as an analytic tool for 
probing the microscopic dynamics on the configuration or phase space for systems whose collective 
behaviour is described by the Havrda-Charvat/Daroczy/Cressie-Read/Tsallis entropy (2). Such a 
straightforwardly computable geometric quantity may be of some interest in performing concrete 
calculations in specific models, calulcations that may shed some light in the dynamical aspects [102] 
of the underlying systems of many degrees of freedom [103] described by such functionals. Our view is 
that this generalised Ricci tensor may do for systems described by Sq what the ordinary Ricci tensor has 
done and can still do for Sbgs [102,103]. The generalized Ricci curvature, which has a purely synthetic 
definition (115) without resorting to differential properties, may also be of some interest to the Quantum 
Gravity community for formulating the Einstein equations in a synthetic rather than in a differential way, 
since it is widely believed that differentiability and smoothness, of space-time should be an emergent, 
rather than an a priori assumed, property. 

We would like to point out that there is very little, if any at all, new material in this work. Ear more 
extensive and authoritative treatments can be found in the literature, such as [35]. Our goal was to help 
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motivate and make somewhat more familiar, from a physieal viewpoint, some ideas that lie behind the 
eonstruetion and properties of the generalised Rieei eurvature. We think that bringing sueh an objeet 
to the attention of the praetitioners of “non-extensive” entropy may ereate some interest whieh may 
eventually help elueidate, analytieally, issues pertaining to the underlying dynamies of systems deseribed 
by Sq. In this spirit, we have omitted entirely from the present work any diseussion whatsoever about 
the regularity and the eonvergenee (usually in the measured Gromov-Hausdorff sense) of the underlying 
struetures. Sueh important issues, and of potential physieal relevanee, ean be found in [35], in the original 
papers or in reeent reviews on these topies. 
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